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5-H ' Abstract 

i We analyze the performance of CSMA in multi-channel wireless networks, accounting for 

the random nature of traffic. Specifically, we assess the ability of CSMA to fully utilize the 
radio resources and in turn to stabilize the network in a dynamic setting with flow arrivals and 
departures. We prove that CSMA is optimal in ad-hoc mode but not in infrastructure mode, 
when all data flows originate from or are destined to some access points, due to the inherent 
bias of CSMA against downlink traffic. We propose a slight modification of CSMA, that we 
refer to as flow-aware CSMA, which corrects this bias and makes the algorithm optimal in all 
cases. The analysis is based on some time-scale separation assumption which is proved valid 
in the limit of large flow sizes. 

Keywords: Wireless network, interference graph, CSMA, flow-level dynamics, time-scale 
separation, stability. 

^ ! 1 Introduction 

The CSMA (Carrier Sense Multiple Access) algorithm is a key component of IEEE 802.11 networks. 
While it proves successful in sharing a single radio channel between a limited number of stations, its 
efficiency is questionable in more involved environments with multiple radio channels and a large 
number of stations having different interference constraints. In this paper, we analyse the ability of 
CSMA to fully utilize the radio resources in such environments, in both ad-hoc and infrastructure 
modes, accounting for the random nature of traffic. Specifically, each station attempts to access 
a randomly chosen radio channel after some random backoff time and transmits a packet over 
this channel if it is sensed idle. We study the random variations of the number of active wireless 
links induced by this random access algorithm and the random activity of users. In particular, we 
analyse the ergodicity of the associated Markov process, which characterizes the ability of CSMA 
to stabilize the network. 

It turns out that, while CSMA is always efficient in ad-hoc mode, in the sense that the network 
is stable whenever possible, it is generally inefficient in infrastructure mode, when all data flows 
originate from or are destined to some finite set of access points. This is due to the inherent bias of 
CSMA against downlink traffic, from the access points to the stations: each access point attempts 
to access the radio channels with the same rate, independently of the number of active downlink 
flows at this access point. We prove that a slight modification of CSMA, which consists in running 
one instance of CSMA per flow at each access point, corrects this bias and makes the algorithm 
optimal. We refer to this algorithm, introduced in [B], as flow-aware CSMA. 

The rest of the paper is organized as follows. We present some related work in the next section. 
The network model in ad- hoc mode is described in section [3] Sections [4] and [5] are devoted to the 
packet- and flow-level dynamics, respectively, assuming time-scale separation. The main result 
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of the paper, given in Theorem [TJ shows in particular the optimality of CSMA in ad-hoc mode. 
The validity of the time-scale separation assumption is discussed in section [5] The infrastructure 
mode is considered in section where we prove the suboptimality of standard CSMA and the 
optimality of flow-aware CSMA. Section [5] concludes the paper. 

2 Related work 

The present work is related to the problem of optimal scheduling in wireless networks. While a 
centralized solution is known since the seminal work of Tassiulas and Ephremides, who proved in 
[24| the optimality of the maximum weight policy, no distributed solution was known until the 
recent works of Jiang, Ni, Shah and Walrand [TTJ [T2l ED]. These authors considered a simple 
CSMA algorithm whereby the attempt rate of each station depends either on the number of 
queued packets or on some local estimates of the arrival rate and the service rate of packets at 
the station. Similar ideas are used by Ni, Tan and Srikant in |18j . The proof of optimality relies 
on the fact that these adaptive versions of CSMA achieve the maximum weight scheduling, under 
some technical assumptions related to the speed of convergence of the algorithm. In practice, the 
algorithm must indeed be carefully designed so as to enforce the time-scale separation, as shown 
for instance in the recent paper of Proutiere, Yi, Lan and Chiang [19 . 

All these papers focus on the packet-level dynamics, assuming packets are generated by some 
fixed number of flows. The flow-level dynamics are ignored, whereas they are known to be critical, 
see for instance [TJ [2j |3j 116) in the context of wireline networks. As in our previous paper [6], we 
consider both the packet- and flow-level dynamics, under the usual assumption that the former 
are much faster than the latter. Specifically, we extend the results of p! to multi-channel networks 
in both ad-hoc and infrastructure modes and discuss the validity of the time-scale separation 
assumption. 

Surprisingly, little attention has so far been paid to multi-channel networks. A notable ex- 
ception is the adaptive, multi-channel version of CSMA introduced in [19j . which is shown to 
maximize the network utility when combined with some appropriate virtual queue mechanism. 
We here prove the optimality of CSMA in the sense of flow-level stability for a very general model 
where the interference constraints may depend on the considered channel and each transmitter 
may only use a subset of the channels. Specifically, we show that it is sufficient for each transmitter 
to probe one of its channels at random, without any further information on the network state. 

Another salient feature of this paper is the observation of the key difference between the ad-hoc 
and infrastructure modes. In the former, the number of transmitters grows with the congestion, 
which increases the channel attempt rate and in turn stabilizes the network. This is not the case 
of the latter since the channel access opportunities of each access point must be shared by all 
downlink flows at this access point. This inherent bias of CSMA against downlink traffic is well 
known, see e.g. [10l EE], and can be easily corrected by letting the attempt rate of each access 
point depend on the number of downlink flows, a scheme we refer to as flow-aware CSMA [6]. The 
algorithm is then optimal. 

3 Model 

3.1 A multi-channel wireless network 

The network consists of a random, dynamic set of wireless links in ad-hoc mode (there is no 
access point at this stage). These links must share some finite number J of non-interfering radio 
channels. Each link consists of a transmitter-receiver pair; the transmitter is able to use at most 
one radio channel at a time. We group links into a finite number of K classes, as illustrated by 
Figure [TJ All links within the same class have the same radio conditions, the same interference 
constraints and the same CSMA parameters. We denote by Xk the number of class-fc links and 
by x the corresponding vector, which we refer to as the network state. Two links within the same 
class cannot be simultaneously active on the same channel. An active class-/c link on channel j 
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transmits data at the physical rate ipk bit/s, independently of j. We say that class k is active on 
channel j if there is an active class-fc link on channel j. 




Figure 1: An ad- hoc wireless network with 4 classes of links and its interference graph. 

Each channel j is associated with some conflict graph Gj — (Vj, Ej), where Vj C {1, . . . , K} is 
the set of classes that are able to transmit on channel j and Ej is the set of edges, each representing 
a conflict. Specifically, two classes k, I £ Vj can be simultaneously active on channel j if and only 
if they do not conflict with each other, that is if (k, I) £" Ej. The J conflict graphs are typically 
the same but could differ due to different radio propagation environments on the J channels, or 
to different transmission capabilities of the K classes. 

3.2 Feasible schedules 

We refer to a schedule as any vector y £ {0, l} KxJ , where ytj = 1 if class k is active on channel 
j. We denote by yk the number of active class-A: links: 

J 

Vk = ^2vkj- 

The schedule is feasible if for all j = 1, . . . , J, the active classes on channel j belong to Vj and do 
not conflict with each other, that is ykjVij — for all (k, I) £ Ej. Moreover, we must have: 

\fk = 1,...,K, yk < x k . (1) 

We denote by y(x) the set of feasible schedules. Note that if Xk > J for all k = 1,. . . ,K, the 
constraint |T]) is no longer limiting (since the number of active class-/c links is limited by the 
number of radio channels J) and the set of feasible schedules becomes independent of the network 
state. We denote by y the corresponding set, which is the union of y(x) over all network states 
x. 

3.3 Capacity region 

Assume that each feasible schedule y is selected with probability ir(y), with J^yey^iv) = 1- The 
mean throughput of class k is then given by: 

4>k = ^k ^2 v^iy)- ( 2 ) 

yey 

Let 4> be the corresponding throughput vector. We refer to the capacity region as the set of vectors 
4> generated by all probability measures n(y), y £ y. Note that the capacity region depends both 
on the physical rates and on the interference constraints of all wireless links. 

4 Packet-level dynamics 

We first analyze the packet-level dynamics induced by CSMA for a static network state x. The 
flow- level dynamics that make x vary are introduced in section [5] 
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4.1 Random access 



We consider the standard CSMA algorithm where each transmitter waits for a period of random 
duration referred to as the backoff time before each transmission attempt. At each attempt, the 
transmitter chooses a radio channel at random and probes it. If the radio channel is sensed idle 
(in the sense that no conflicting link is active), a packet is transmitted (we neglect tho channel 
after some random backoff time and transmits a packet over this channel if it is sensed idle. We 
study the random variations oe collisions) ; otherwise, the transmitter waits for a new backoff time 
before the next attempt. 

Packets have random sizes of unit mean and are transmitted at the physical rate (fk on class-fc 
links; the backoff times of class-fc transmitters are random with mean 1/vk , where Vk > is the 
corresponding attempt rate. We denote by at = l^k/fk the ratio of the mean packet transmission 
time to the mean backoff time of class-fc links. Channel j is chosen with probability fikj, with 
J2j=i Pkj — 1 an d Pkj > if and only if k E Vj, so that all accessible channels are attempted with 
positive probability. 

4.2 Stationary distribution 

Let Y(t) be the schedule selected by the above random access algorithm at time t. We look for the 
stationary distribution of Y(t), which we denote by ir{x,y) to highlight the fact that it depends 
on the network state x. We have: 

Proposition 1. If both the packet sizes and the backoff times have exponential distributions, then 
Y(t) is a reversible Markov process, with stationary measure: 

w ^y)= n li j t, ■ o) 

k:x k >0 ^ y "> j=l 

Proof. Let ekj be the unit vector on component fc,j on {0,l} KxJ . The Markov process Y(t) 
jumps from state y to state y + ekj with rate (xk — yk)^kPkj (since all idle links attempt to access 
the channel) and from state y + ekj to state y with rate ifk (since all class-fc links have the same 
physical rate ifk, independently of the used channel), for any state y such that y + ekj G y{x) . 
The proof then follows from the local balance equations: 

w(x,y)(x k - yk)vkfik] = w(x,y + e kj )ip k . 

□ 

The stationary distribution n(x, y) follows from the normalization of the stationary measure 
w(x, y) over all y S y{x) . We deduce the mean throughput of class k in state x: 

(f) k (x) = ip k ^2 ykir(x, y). (4) 
y&y 

It turns out that, by the insensitivity property of the underlying loss network [5], these expressions 
are in fact valid for any phase-type distributions of packet sizes and backoff times; such distributions 
are known to form a dense subset within the set of all distributions with real, non-negative support 
23 , so that the results hold for virtually any distributions of packet sizes and backoff times. We 
refer the reader to 25 for further details on this insensitivity property. 

5 Flow- level dynamics 

We now introduce the flow-level dynamics under the assumption of infinitely fast packet-level 
dynamics; the validity of this time-scale separation assumption is discussed in section [5] 
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5.1 Traffic characteristics 



We assume that flows using class-fc links are generated according to a Poisson process of intensity 
Afc. Each such flow has an exponential size with mean <ik bits and leaves the network once the 
corresponding data transfer is completed. There is a one-to-one correspondence between flows and 
links so that both terms are used interchangeably in the following. We denote by pk = \kO~k the 
traffic intensity of class fc (in bit/s) and by p the corresponding vector. 

Under the time-scale separation assumption, the flow-level dynamics are much slower than the 
packet-level dynamics so that, at the time scale of a flow, everything happens as if the stationary 
distribution (J3j) of the packet-level dynamics were reached instantaneously. In particular, the mean 
throughput of class k is given by (j4|) in state x. 

5.2 Stability region 

Let Xk{t) be the number of class-fc flows at time t. The corresponding vector X(t) describes the 
evolution of the network state. This is a Markov process with transition rates from state x to 
state x + efc and <j>k(x)/&k from state x to state x — (provided Xk > 0), where efc denotes the 
unit vector on component fc. 

We say that the network is stable if this Markov process is ergodic. Clearly, a necessary 
condition for stability is that the vector of traffic intensities p lies in the capacity region. The 
following key result of the paper shows that this condition is in fact sufficient, up to the critical 
case where p lies on the boundary of the capacity region. In this sense, CSMA is optimal in the 
considered ad-hoc mode. 

Theorem 1. The network is stable for all vectors of traffic intensities p in the interior of the 
capacity region. 

The proof is deferred to the appendix. It is based on the fact that the random access algorithm 
selects schedules in proportion to their weights ((3|). For large x, this is equivalent to selecting 
schedules in proportion to the following uniform weight, which is independent of the channel 
probing distribution: 

u(x, y )= Jl (xka.k) Vk , yey(x). (5) 

k:Xfc >0 

Defining: 

u(x) — max u{x,y), 
yey(x) 

the following result, also proved in the appendix, shows that those schedules of maximum weight 
are actually selected with probability close to 1: 

Lemma 1. For any e > 0, we have: 

k(x, y) log(u(x, y)) > (1 - e) log(u(x)) 

for all states x but some finite number. 

The result then follows from the stable behavior of maximum weight scheduling, except that 
the latter is defined over the set of all feasible schedules. Defining the corresponding weight by: 

v(x) — max it(x, y), 

the following result, proved in the appendix, shows that it is essentially the same as u{x): 
Lemma 2. We have: 

v(x) 
sup — — < oo. 

xeX u(x) 

The proof of Theorem I, based on Lemmas 1 and 2, then follows from Foster's criterion. 
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6 Time-scale separation 



Theorem [T] is based on the time-scale separation assumption: in the packet-level model of section 
21 packets "see" a fixed number of flows, while in the flow- level model of section [5J flows "see" the 
equilibrium state of packet-level dynamics. In this section, we remove this assumption. Specif- 
ically, we prove that when the size of the flows grows, the model without time-scale separation 
converges to the model with time-scale separation, which indeed suggests that CSMA is optimal 
for sufficiently large flow sizes. We actually conjecture that CSMA is optimal for any flow size, 
which we prove at the end of the section for a specific class of networks. 

6.1 Scaling 

As in section class— fc flows are assumed to arrive according to a Poisson process of intensity 
Afe. The number of packets per class-fc flow has a geometric distribution with mean Na^, where 
N is some positive integer, we refer to as the scaling parameter. In particular, each class-fc flow 
terminates with probability l/(akN) after each packet transmission. Packets are assumed to have 
an exponential size with mean 1/N bits, so as to keep the class-fc mean flow size constant and 
equal to crfc bits. In particular, the corresponding traffic intensity pk = \k&k is independent of N. 

The random access algorithm is that described in section FOI The only difference is that the 
attempt rates must be scaled so as to keep the ratio of mean packet transmission time to mean 
backoff time constant. Thus each class-fc link now attempts to access the channels at rate Nv^. 

6.2 Asymptotic time-scale separation 

The state of the network is now described by the couple (X (t),Y (t)), where X N (t) gives the 
number of flows of each class at time t and Y (t) the schedule that is selected at time t. This 
is a Markov process with transition rates from state (x,y) to state (x + ek,y) (class-fc flow 
arrival), N(xk — Dk)vkPkj from state (x,y) to state (x,y + e^j) (access to channel j by a class-fc 
flow), Nykjifiki^ — l/(c r fcA r )) from state (x, y) to state (x, y — ejy) (packet transmission of a class-fc 
flow over channel j, without flow completion), ykjtpkl&k from state (x, y) to state (x — ek,y — ejy) 
(packet transmission of a class-fc flow over channel j, with flow completion). 

When N grows, the packet-level dynamics, represented by Y (t), are accelerated with respect 
to the flow-level dynamics, represented by X N {t). The following result, proved in the appendix, 
shows that there is indeed time-scale separation between the packet level and the flow level in the 
limit. We assume that X N (0) = X(0) for all N > 1. 

Theorem 2. When N — > oo, the stochastic process X N (t) converges in distribution to the Markov 
process X(t), which describes the network state under the time-scale separation assumption. 

6.3 Stability of some class of networks 

TheoremsQ]and[2]suggest that, CSMA is optimal for sufficiently large flow sizes. We conjecture that 
CSMA is actually optimal for any flow size, in the sense that the Markov process (X N (t), Y N (t)) 
is ergodic for any scaling parameter N > 1 provided the vector of traffic intensities p lies in 
the interior of the capacity region. To support this conjecture, consider the following class of 
networks. We assume that all links have access to the J channels. The interference graph is the 
same on all channels and given by some L-partite graph, i.e. there exists a partition {C\, . . . , Cl} 
of {1, . . . , K} such that two classes in C; do not interfere with each other but a class in Ci does 
interfere with all classes in {1, . . . ,K} \ C%. Examples of L-partitc graphs are given in figure [5J 
The following result, proved in the appendix, shows that CSMA is optimal independently of the 
scaling parameter Af: 

Proposition 2. Any network with a L-partite interference graph is stable for all vectors of traffic 
intensities p in the interior of the capacity region. 
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Ci = {l,2}, 

Ci = {3}, Ci = {1,2,3}, C 2 = {3,4}, 

C 2 = {1, 2, 4, 5}. C 2 = {4, 5, 6}. C 3 = {5}. 

(a) (b) (c) 

Figure 2: Examples of 2-partite (a)-(b) and 3-partite (c) graphs. 



7 Infrastructure-based networks 

We have so far considered a network in ad-hoc mode, without infrastructure. We now consider 
N access points to which users must connect. In particular, each class now corresponds either to 
uplink traffic (from the users to an access point) or to downlink traffic (from an access point to the 
users). We study the flow- level dynamics of CSMA under the time-scale separation assumption. 
Specifically, we prove the suboptimality of standard CSMA in this context and introduce a slight 
modification of CSMA, we refer to as flow-aware CSMA, which makes the algorithm optimal. 



7.1 Uplink vs. downlink 

For alii = 1, . . . , N, we denote by Ui and Di the sets of uplink and downlink classes, respectively, 
associated with access point i. In the example of figure El for instance, there are N = 2 access 
points and K = 6 classes, with U\ = {2}, D\ = {1,3}, U-2 = {5} and D2 = {4,6}. An access 
point cannot transmit and receive on the same channel. In particular, those classes sharing the 
same access point, either in uplink or downlink, conflict with each other. Formally, for all access 
points i = 1, . . . , N and all classes k,l 6 K;U Di, we have (k, I) € Ej for each channel j such that 
k, I £ Vj. We assume that an access point cannot transmit data on more than one channel at a 
time but is able to receive data on the J channels simultaneously. 




Figure 3: A network of 2 access points with 6 classes of links and its interference graph. 

The feasible schedules are those defined in section 13.21 with the additional constraint that each 
access point cannot transmit data on more than one channel at a time, that is: 

V ? = 1,...,A, ^y fc <l. (6) 

keDi 
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We denote by y(x) the set of feasible schedules and by y the union of y{x) over all network states 
x. The corresponding capacity region is defined in section [3.31 



7.2 Standard CSMA 

We first consider the standard CSMA algorithm: each transmitter waits for a period of random 
duration before attempting transmission on some randomly chosen channel. The key difference 
with the ad-hoc wireless network considered so far is that each access point runs a single instance 
of the CSMA algorithm for all its downlink traffic. In particular, for each access point i, the 
attempt rates Vk are the same for all classes k £ Di. At each attempt, the access point i selects a 
class-fc flow with some probability proportional to Xk and probes channel j with probability j3kj- 
If the probed channel is sensed idle, a packet of this flow is transmitted. 

It is worth noting that the attempt rate of each access point is independent of its congestion 
level, in terms of the number of ongoing downlink flows at this access point. This breaks the natural 
stabilizing effect of CSMA we have proven in Theorem [T] in the context of ad- hoc networks, where 
those classes with a higher number of flows get preferential access to the radio channels. In the 
following, we illustrate the suboptimality of standard CSMA on two examples with downlink traffic 
only. Note that, in the presence of uplink traffic only, the model is in fact equivalent to the ad- hoc 
network considered so far. 

For this purpose, we give the distribution of feasible schedules achieved by the algorithm under 
the time-scale separation assumption. Denoting by Y(t) the schedule at time t, we have the 
analogue of Proposition [T] 

Proposition 3. If both the packet sizes and the backoff times have exponential distributions, then 
Y(t) is a reversible Markov process, with stationary measure: 

N . J 

=n n <^<nc 

i=l keUi:x k >0 " 3 = 1 ^ 

\keDi / keDi-.x k >0 k ' j = l 

Proof. As for Proposition [TJ the proof follows from the local balance equations. For all i,. . . ,N, 
we have: 

Vfc € Ui, w(x,y)(x k -yk)vkPki = w(x,y + e k j)ipk, 

and 

X k 

Vfc e A, w(x,y)— VkPk 3 = w(x,y + e k j)tpk- 

l^ikeDi x k 

□ □ 

The stationary distribution of the schedules ir(x, y) follows from normalization. Again, it is 
insensitive to the packet size and backoff time distributions beyond the means. The throughput 
of class fc is given by (j4j . 



Example 1 The most simple example showing the suboptimality of CSMA is shown in Figure 
SJ It consists of N — 3 access points, a single class per access point and a single channel. Taking 
unit physical rates, the optimal stability region is p\ + p2 < 1 and p2 + P3 < 1 where 1 and 3 are 
the edge classes and 2 is the center class. We have proven in [6] that the actual stability region is 
strictly smaller, even in the limiting case of infinite attempt rates. 
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Figure 4: Network of 3 access points with a single downlink class per access point and its inter- 
ference graph. 

Example 2 Consider the multi-channel network of Figure [5] with N = 5 access points, a single 
class per access point and J = 2 channels, further referred to as the bow tie network. The conflict 
graph is the same for both channels. We refer to class 3 as the center class and to the other classes 
as the edge classes. We assume that the mean packet sizes and the mean backoff times are the 
same for all classes, so that = a for all k = 1, . . . , 5, for some a > 0. We also assume that 
all classes except class 3 have the same traffic intensities. The optimal stability condition is then 
given by: 

p 3 < 1 and 2,01 + p 3 <2. (8) 




Figure 5: Network of 5 access points with a single downlink class per access point and its inter- 
ference graph. 

We consider the limiting case where a — > oo and we assume that the two channels are chosen 
uniformly at random. We then deduce from ©-(UJ) the following throughput vector: 



f (1,1,0,1,1) 


if 


Xl 


X 2 ,X4,X 5 > 0, 


(3 3 1 X q) 


if 


X\ 


X2,X 3l X4 > 0,x 5 = 0, 


(2 2 2 o q\ 

I 3 ' 3 ' 3 ' u ' u / 


if 


X\ 


x 2 ,x 3 > 0,x 4 = x 5 = 0, 


(0,1,1,1,0) 


if 


Xl 


X 3 ,X4 > 0,Xi — x 5 = 0, 


(1,1,0,0,0) 


if 


X\ 


x 2 > 0, x 3 = Xi = x 5 = 0, 


(1,0,0,0,0) 


if 


X\ 


> 0, X 2 = 0, X 3 = X4 = x 5 = 



(9) 



The other cases follow by symmetry. The center class is in conflict with all other classes for 
accessing the channels and is either not served when the 4 other classes are active or served at a 
low rate when 3 other classes are active. This also results in a suboptimal stability region: 

Proposition 4. The bow tie network is unstable whenever: 

p^>\p\-\p\-\pi + ^ (10) 
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This proposition is proven in the appendix. In the homogeneous case p\ = p^ for instance, 
PropositionUimplies that the network is unstable whenever pi > 0.63. In view of ([5]), the optimal 
stability condition is p\ < 2/3, which shows that the standard CSMA algorithm is not optimal. 
This suboptimality is illustrated by Fig. [SJ the actual stability condition being obtained by the 
simulation of the underlying Markov process. In the homogeneous case for instance, the loss of 
efficiency is around 15%. 




0.2 0.4 0.6 0.8 1 
Edge link load 

Figure 6: Stability region of the bow-tie network with two channels under standard CSMA. 



7.3 Flow-aware CSMA 

The flow-aware CSMA algorithm consists for each access point to run one standard CSMA algo- 
rithm per flow. This compensates for the inherent bias of standard CSMA against downlink flows 
and stabilizes the network whenever possible. Indeed, the stationary measure of the schedules is 
now given by ©. The only difference with the ad- hoc wireless network considered in section [5] is 
the additional constraint ^ on the set of feasible schedules. This does not change the proof of 
Theorem [TJ showing the optimality of flow-aware CSMA. 

8 Conclusion 

We have proved that, under the time-scale separation assumption, the distributed scheduling 
achieved by standard CSMA exploits the radio resources in an optimal way in ad-hoc wireless 
networks. This is not the case in the presence of access points, due to the inherent bias of CSMA 
against downlink traffic. A slight modification of CSMA we refer to as flow-aware CSMA is then 
sufficient to correct this bias and to make the algorithm optimal. 

The analysis relies on a number of simplifying assumptions that we plan to relax in future work. 
First, we have neglected the impact of packet collisions; these could be included in the model, as 
done in [T3] for rate-based adaptive CSMA for instance. One may then account for the adaptive 
backoff of the IEEE 802.11 protocol, which is key in practice to limit the number of collisions. 
Other issues that may be worth addressing concern the traffic model. We have neglected the impact 
of acknowledgements, which are known to be critical in IEEE 802.11 networks. The impact of 
real-time traffic should also be considered. Finally, one may think of multi-hop networks where 
the flows of some source-destination pairs must go through one or several relay nodes. Although 
we believe that flow-aware CSMA is still optimal in this more general settings, we have not yet 
been able to prove this result. 



10 



From a more theoretical perspective, one may relax the assumption of Poisson flow arrivals 
and exponential flow sizes in the stability analysis. One may for instance consider user sessions 
that consist of an alterning series of file transfers and idle periods. We would also like to extend 
Proposition [2] to any interference graph, which would prove the validity of Theorem Q] in the 
absence of the time-scale separation assumption. 



Appendix 

Proof of Lemma 1 For any class k, let: 

f3 k = min /3 kj . 

Note that (3 k > 0. We have for all y e y(x): 

Xkjxk - 1) . . . (x k - Vk + 1) qj 



If x k < 2 J, we have: 



(x,y)> [[ ^ P j k u{x,y). 

k:x k >Q k 



X k (x k - 1) . . . (Xfe - y k + 1) J_ 1 



Otherwise, we have using the fact that g/fe < J for all A; = 1, . . . , K; 

x k (x k - 1) ... (x k -y k + 1) > / Jfc-yfc + i y* > 1 



xf " V ^fc / " 2 J ' 

Combining these results, we obtain the existence of some constant m > such that: 

\fy e y(x), w(x, y) > mu(x, y). 

Now let: 

Z(x) = [y G y(ar) : log(u(i,y)) > (1 - |)log(«(a:))} . 

We have: 

7r(a;,y)bg(u(aj,j/)) > (1 - -)log(u(i)) ^ 7r(a;,y). 
Using the fact that w(x, y) < u(x, y) for all y € y(x), we get: 

V n^w = / — ^— > 

i/e:y(*)\z(«:) 2^ y ey( x ) w ^y) 

i E !/ £}'(i)\z(i) "(^ y) 



< 



1 Mu(x) 



1 M 



m zt(x) a ' 

where M denotes the total number of schedules (that is, the cardinal of y). Since u(x) tends to 
+oo when \x\ = x k tends to +oo, this quantity is less than e/2 for all states x but some finite 
number. In those states, we have: 



yeZ(x) 



c 
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We deduce that in all states x but some finite number: 



7r ( x ' V) l °s(u{x, y)) > (1 - 2 ) 2 log(«(a;)), 
> (1-e) Iog(u(a;)). 



Proof of Lemma 2 Let: 

t)(a;,j/) = ] [ (x k a k ) Vk . 

k:x k >J 

There are some positive constants to, M such that: 

V.x g N K , Vy G y, to < ^4 < Af- 

v{x,y) 

The proof then follows from the fact that: 

M M 
v(x) =maxu(a;,«) < Mmax»(i,i/) = M max v(x,y) < — max u(x,y) = — u(x). 



□ 



□ 



Proof of Theorem 1 If the vector of traffic intensities lies in the interior of the capacity region, 
there exist some e > and some probability measure it on y such that: 

Vk = l,...,K, p k = ip k (l-2e)J2n(y)y k . (11) 

y&y 

Note that we can choose w(y) > for all y e y. 
Define the Lyapunov function: 

x k a k 



F(x)= V \og(x k a k )- 

-■ — * 

The corresponding drift is given by: 



k:x k >0 Lfk 



AF(x)=J2*k(F(x + e k )-F(x)) + ^-(F(x-e k )-F(x)), 

k k:xk>0 

= Y —\°g( a k)+ Y —(( x k + l)iog((x k + l)a k )-x k \og(x k a k )) 

k:x k =0 L/?k k:x k >0 ^ k 

+ Y {{xk - 1) log((a;fc - l)afe) - x k \og{x k a k )) . 

k-.x k >o ipk 

In particular, we have AF(x) = G(x) + H{x) with: 



G(x)= 2^ log(a; fc a fe ), 

k-.x k >a L/?k 



H(x)= ^(x fc + l)log(l + -)+ ]T ^1(^-1)106(1--)+ £ ^logK), 

k:x k >0^ k Xk k:x k >0 ^ k Xk k:x k =0 ^ k 
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where we use the convention 01og(0) = 0. Since (f>k{x) < J(p k , the function H{x) is bounded. 
Regarding G(x), it follows from and (fTTj) that: 

G{x) = E^ 1 -2e)7r(y) - ir(x,y)) ^ y k \og(x k a k ), 

V&y k:x k >Q 

= ^((1 - 2e)n(y) - n(x, y)) log(u(x, y)). 
yey 

By Lemma 1, we have for all states x but some finite number: 

G(x) < -e E n(v) log(ti(z, y)) + (1 - e) E 7r(y) log(u(jc, y)) - log(«(x)) 



< -e X! ^(y) ^sM^ y)) + i 1 - e ) lo s 



u(:r) 
w(a;) 



Since 7r(y) > for all y € y, the first term tends to — oo when |x| = ^2 k x k tends to +oo. 
By Lemma 2, the second term is bounded. We deduce the existence of some S > such that 
AF(x) < —S for all states x but some finite number. The proof then follows from Foster's 
criterion. □ 



Proof of Theorem 2 In the following, we consider (X N (i))jv>i as a sequence of stochastic 
processes in the space T> n k ([0, oof) of cad-lag functions with values in N K with the Skorohod 
topology. 

First, we have to prove the tightness of the sequence (X (t)). It is enough to remark that, for 
all N > 1, X£(t) is stochastically dominated by a Poisson process of intensity X k and stochastically 
dominates an M/M/l queue with arrival rate X k and service rate ip k /a k . Thus, the conditions of 
the Arzela-Ascoli theorem are fulfilled and the sequence (X N (t)) is tight (see [4j Th 12.3]). 

We now consider a bounded function / on N K . Denote by Q N the infinitesimal generator of 
the Markov process (X N (t),Y N (t)). For x E N K and y 6 y, we have 

K K J 

n N (f)( x , y ) = Y^ wo* + ^) - /(*)) - E ^> fc E yv(H x - efc ) - 

fc=l k=l j = l 

According to the Martingale characterization of Markov jump processes (see [22]), the process: 

Mf(t) = f(X N (t)) - f(X N (0)) ~ f n N (f)(X N ( S ),Y N (s)) ds 

Jo 

is a locale martingale and, since the process X N (t) is not exploding on [0,t] (it is stochastically 
dominated by a Poisson process), it is a martingale. 
For each N > 1, define the random measure: 

r A '([0, t] x B) = j l {Y{ s)eB} ds, for B C y 
Jo 

T N is a random variable with value in the set C(y) of the random measures on [0,oo[x^ such 
that if (x € £(y) then /i([0, (] x J) = ( for all t > 0. Since 3> is finite, the set -C(^) is compact and 
then the sequence (r^Jjv^i is relatively compact. 

Assume that the sequence (X (t),T )jv>i tends to some limit (X(t),T). Since: 

* n N (f)(x N (s), y n (s)) d s = /* E n^/X^OO, y)r"(d* x d y ) 

Jo yey 
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and / is bounded, this random variable tends in distribution to: 

5>(/)(X( S ) l2/ )r(d S xdy). 



It remains to characterize T. According to Lemma 1.3 of [15J . there exists a set of random 
probability measures $(t, .) on y such that: 



r([0, t] X B) = I i9(s, B) ds, for Bey. 
Jo 



For any function g on y, we define the martingale: 



M?(t) = ±{g(Y N (t))-g(Y N (0))- / ilUgHX^,).Y\s)),l, 



For a; £ N and y S y, we have: 

tt N (g){x,y) = YY N ( Xk ~ Vk)v k Pkj{g{y + e kj ) - g(y)) 
k=lj=l 

+ (Ny kj (p k U - + (5(2/ - e fc j) - ,g(y)). 

The increasing process of this martingale is: 

«(<)) = ±J*n N ( 9 )(X N ( S ),Y N (s))as, 
It ( 

< — max I q(y) I max 09 1 + max i/t max B k « 
N yey \ k k k,j J 

It tends to on all compact sets so that the martingale tends in distribution to 0. Since y is 
finite, g is bounded and (g(Y N (t)) - g(Y N (0)))/N also tends to 0. Finally, we get that: 



1 '* 



NJ 

converges in distribution to 0. This implies: 
,t / k J 



n N {g){X N {s),Y N {s))As 



j ^2( X k( s ) - Vk)vkB k3 {g{y + e kj ) - g(y)) 

J o y ey\k=ij=i 



+ Vkj¥k{g(y ~ e kj ) - g(y))j 0(s, y) As = 

and for almost every s in [0,t], we have: 

/ K J 

Yl YY^ Xk ^ ~ Vk) v kBkj{g{y + e kj ) - g(y)) 
yey\k=ij=i 

+ yk 3 <Pk(g{y - e kj ) - g(y))^j i?(s, y) = 

The probability distribution i9(s, .) is then the stationary distribution given by ([3]). 
It follows that: 

t n N (f)(X N {s),Y lf (s)) As 



II 
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converges in distribution to: 



n(/)(x( s )) ds 



where ft is the inhnitesimal generator of the Markov process described in section [5] For x G N^, 
we have 

K 

= E A * + e fe) - + ^WM (/(* " e k) ~ /(*)) > 
fc=i 

where <pk{x) is the mean throughput of class fc in state x, given by ((4]). 
By dominate convergence, Mf(t) tends in distribution to: 

M/(t) = f(X(t)) - f(X(0)) - [ n(f)(X(s)) ds, 

Jo 

and Mf(i) is a martingale. Using the characterization of the Markov jump processes, we get that 
the process X(t) is a Markov process with infinitesimal generator f2. 

This concludes the proof. □ 

Proof of Proposition 2 For this proof, we will need the notion of fluid limits. A fluid limit is 
a limiting point X N (t) of the laws of the processes {X N (nt)/n,n > 1} in the set of probability 
measures on the space P R x([0, oo)) of cad-lag functions with value in with Skorohod topology 

(see [4]). It is not difficult to show that the set of processes {X N (nt)/n, n > 1} is tight in the set 
of probability distributions on the space D r k([0, go)) endowed with the metric associated to the 
uniform norm on compact sets. Therefore, there exists at least one fluid limit and any fluid limit 
is continuous. Since the process Y N (nt) has its values in a finite space for all n > 1, it can be 
proved as in [8, 2T] that, if there exists a deterministic time T > such that X N (t) — for all 
t > T, then the Markov process (X N (t), Y N (t)) is ergodic. 

The proof is then very similar to that given in [9] for random capture algorithms. We consider 
a fluid limit X N (t) and define: 



i—i x ' 



When some class in C; takes channel j, all other classes in Ci can take this channel while all classes 
in {1, . . . , K} \ Ci cannot. This implies: 



W N (t) < max [ 0, 1 + [ V max £± - j] t] 



In the case of L-partite networks, the capacity region is given by the set of vectors <fi such that: 

> max — < J. 

f-^ feed tp k 

Since p lies inside the capacity region, we have W N (t) = for all t >T, with 

1 



J - Ei=i max tec, Pk 



which implies the ergodicity of the Markov process [X (t), Y (t)). □ 
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Proof of Proposition [4] Define the throughput vector <j> such that 03 (x) = 03 (x) and <fik(x) = 
l{ Xk >o\ for all k 3. It can be easily verified that 4>k{x) > 4>k{y) for all states x,y such that 
x < y and all k such that Xk > 0. Now consider the coupling of the stochastic processes X(t) and 
X(i) describing the evolution of the queues for the throughputs and 0, respectively, starting 
from the same initial state X(Q) — X(0). It follows from the above monotonicity property that 
X(t) < X(t) a.s. at any time t > 0. In particular, the transience or the null recurrence of X(t) 
implies that of X (t) . 

For the throughput vector 0(f), queues 1,2,4,5 are independent M/M/l queues with load p\. 
If Pi > 1, the Markov process X(t) is null recurrent or transient. Note that (fTU)) then reduces to 
P3 > 0. 

Assume now that p\ < 1. To prove the transience of X(t), we use fluid limits. Since p\ < 1 and 
for <f>(t), queues 1,2,4,5 are independent M/M/l queues with load p±, there exists some finite time 
after which, for any initial conditions, the corresponding components of the fluid limit are null. 
We then just have to consider the fluid limits with the initial condition A^O) = 1 and -Xfe(O) = 
for all k ^ 3. In this case, Proposition 9.14 of [211 P-241] applies and the fluid limit satisfies: 

X 3 (t) = l+(\ 3 -^jt, 

as long as this function is positive, where 03 is the throughput of link 3 averaged over the states 
of other links. Since each other link is active with probability pi, it follows from (|9]) that: 

03 = \ P t ~ \ P \ - \ P \ + I- 

In particular, X$(t) increases linearly to infinity whenever inequality (|10[) is satisfied and, according 
to [17], the Markov process X(t) is transient. □ 
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